Blind signal processing methods for microphone leakage suppression in multichannel audio applications
نویسنده
چکیده
This thesis examines the problem of microphone leakage, that is the interference between simultaneously active sound sources in multichannel audio applications. Despite being a common problem with which sound engineers are confronted every day, almost no signal processing methods have been proposed to address this issue. In this work, the problem is formulated for the first time in a signal processing framework. First, it formulated inside the blind source separation (BSS) context and the limitations of related methods are analysed and reported. Since, BSS methods seem to be inappropriate for this specific problem, it is reformulated as a signal in noise problem inside the well-known noise suppression framework. Based on the widely adopted close-microphone technique a novel, generalized framework for leakage suppression is derived based on a multichannel Wiener filter. The acoustic system that models the mixing process is analysed and the related room impulse responses are discerned in direct and leakage acoustic paths. The properties of the direct acoustic path, that is the close-microphone response are investigated for the first time, from a signal processing point of view as well as a room acoustics perspective. The properties of the leakage acoustic path are also analysed for the first time using room acoustic parameters. After key properties of the acoustic paths have been identified, a method for the suppression of microphone leakage in a two channel audio setup is developed based on a Wiener filter and a crude approximation of the related power spectral densities (PSDs). The performance of this method for actual recordings in real reverberant environments is more than adequate and based on these results, the method is extended for more than two sources and microphones in arbitrary arrangements. The complete method is blind and automatic, since it does not require any user input. It does not assume any prior knowledge or require training and is computationally efficient. A novel solo detection method has been developed that allows the estimation of weighting coefficients that correspond to the relative attenuation experienced by sound sources as they travel to each microphone. Combined with a new and advanced PSD estimation method based on the identification of dominant frequency bins, the related PSDs in a multichannel audio application can be identified. From these an appropriate multichannel Wiener filter for each microphone signal can be calculated, which will provide the estimated source signal at its output.
منابع مشابه
Reduction of Microphone Leakage Using Multichannel Wiener Filter
Microphone leakage is one of the most prevalent problems in audio applications involving multiple instruments and multiple microphones. Currently, sound engineers have limited solutions available to them. In this paper, the applicability of two widely used signal enhancement methods to this problem is discussed, namely blind source separation and noise suppression. By extending previous work, i...
متن کاملMultichannel Linear Prediction for Blind Reverberant Audio Source Separation
A class of methods based on multichannel linear prediction (MCLP) can achieve effective blind dereverberation of a source, when the source is observed with a microphone array. We propose an inventive use of MCLP as a pre-processing step for blind source separation with a microphone array. We show theoretically that, under certain assumptions, such pre-processing reduces the original blind rever...
متن کاملAdvancements in the Time-Frequency Approach to Multichannel Blind Source Separation
The ability of the human cognitive system to distinguish between multiple, simultaneously active sources of sound is a remarkable quality that is often taken for granted. This capability has been studied extensively within the speech processing community andmany an endeavor at imitation has beenmade. However, automatic speech processing systems are yet to perform at a level akin to human profic...
متن کاملMultichannel audio signal source separation based on an Interchannel Loudness Vector Sum
In this paper, a Blind Source Separation (BSS) algorithm for multichannel audio contents is proposed. Unlike common BSS algorithms targeting stereo audio contents or microphone array signals, our technique is targeted at multichannel audio such as 5.1 and 7.1ch audio. Since most multichannel audio object sources are panned using the Inter-channel Loudness Difference (ILD), we employ the ILVS (I...
متن کاملBlind Audio Source Separation using Short+Long Term AR Source Models and Iterative Itakura-Saito Distance Minimization
Blind audio source separation (BASS) arises in a number of applications in speech and music processing such as speech enhancement, speaker diarization, automated music transcription etc. Generally, BASS methods consider multichannel signal capture. The single microphone case is the most difficult underdetermined case, but it often arises in practice. In the approach considered here, the main so...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012